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Abstract 

We investigate individual packet delay in a model of data networks with table-free, 
partial table and full table routing. We present analytical estimation for the average 
packet delay in a network with small partial routing table. Dependence of the delay 
on the size of the network and on the size of the partial routing table is examined 
numerically. Consequences for network scalability are discussed. 



1. Introduction 



Importance of packet-switched data networks in contemporary society cannot be overesti- 
mated. In an attempt to understand their complex dynamics, several simplified models have 
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been proposed in recent years 0, [T3|, |1^, |1^, The construction of these models have 
been inspired by successful and well established in physics methodologies of particle systems, 
cellular automata and lattice gas cellular automata. The application of these methodologies 
in the context of data networks provides a promising alternative approach. Even though 
some of these models are simplistic, they can be expanded and modified to incorporate var- 
ious realistic aspects of data networks. Additionally, these models are not only amenable to 
computer simulations but also to obtaining analytical results. 

One of the interesting questions which needs to be addressed in the context of these 
models is an issue of influence of the randomness present in the routing algorithm on the 
network's dynamics and its effects on the performance of the network. 

In we investigated a model in which packets are routed according to a table stored 
locally at each node. If the table includes all other nodes of the network, such an algorithm 
is called full table routing algorithm. However, if only nodes closer than m links away are 
present in the table (partial table routing), packets with a destination address not present 
in the table are forwarded to a randomly selected nearest neighbour node. This introduces 
certain amount of randomness or noise into the system, and as a result, the delay changes. 
By delay we mean the time required for a packet to reach its destination. 

In this work, we will investigate how the delay experienced by a single packet, when no 
other packets are present, depends on the degree of randomness in the routing scheme. While 
interactions with other packets will obviously strongly influence the delay, in we found 
that the delay experienced by a single packet is an important parameter characterizing the 
network. For example, simulation experiments reported in ]13[ seem to indicate that in many 
cases the critical load is inversely proportional to the single packed delay. In an attempt 
to gain some insight into properties of this important parameter, we will derive analytical 
estimates for the single packet delay and compare it with direct simulations. Finally, we will 
discuss how these results affect scalabihty of the proposed network model. 



2. Network Models Definitions 



Detailed description of the network model is given in ||T3[. Here, we summarize only its main 
features. The purpose of the network is to transmit messages from points of their origin to 
their destination points. In our model, we will assume that the entire message is contained 
in a single "capsule" of information, which, by analogy to packet-switching networks, will 
be simply called a packet. In a real packet-switching network, a single packet carries the 
information "payload" , and some additional information related to the internal structure of 
the network. We will ignore the information "payload" entirely, and assume that the packet 
carries only two pieces of information: time of its creation and the destination address. 

Our simulated network consists of a number of interconnected nodes. Each node can 
perform two functions: of a host, meaning that it can generate and receive messages, and 
of a router (message processor), meaning that it can store and forward messages. Packets 
are created and moved according to a discrete time parallel algorithm. The structure of the 
considered networks and the update algorithm will be described in subsections which follow. 
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2.1. Connection Topology 



In this paper, we will consider a connection topology in a form of a two-dimensional square 
lattice with periodic boundary conditions D' . The network hosts and routers are located at 
nodes of the lattice CP . The position of each node on a lattice C? is described by a discrete 
space variable r, such that 

r = zc^+jCj^, (1) 

where Cj,, are Cartesian unit vectors, and z, j = 1, . . . , L. The value of L gives a number 
of nodes in the horizontal and vertical direction of the lattice D' . We denoted by C(r) the 
set of all nodes directly connected with a node r. For each r G C? ^ the set C(r) is of the 
form 

(2) 



C(r) = {r - c^, r + c^., r - Cy, r + c^^}. 



In this case, the node r is connected with its four nearest neighbours. In the networks 
considered here, each node maintains a queue of unlimited length where the arriving packets 
are stored. Packets stored in queues, at individual lattice nodes, must be delivered to their 
destination addresses. To assess how far a given packet is from its destination, we introduce 
the concept of distance between nodes. We will use periodic "Manhattan" metric to compute 
the distance between two nodes ri = (zi,ji) and = {^2,32)'- 



L- 



«2 - ^1 



|j2 - ill - 



(3) 



2.2. Update Algorithms 

The dynamics of the networks are governed by the parallel update algorithms similar to the 
algorithm used in [|16]. We start with an empty queue at each node, and with discrete time 
clock k set to zero. Then, the following actions are performed in sequence: 

1. At each node, independently of the others, a packet is created with probability A. Its 
destination address is randomly selected with uniform probability distribution among 
all other nodes in the network. The newly created packet is placed at the end of the 
queue. 

2. At each node, one packet (or none, if the local queue is empty) is picked up from the 
top of the queue and forwarded to one of its neighboring sites according to a one of 
the routing algorithms to be described below. Upon arrival, the packet is placed at 
the end of the appropriate queue. If several packets arrive to a given node at the same 
time, then they are placed at the end of the queue in a random order. When a packet 
arrives to its destination node, it is immediately destroyed. 

3. is incremented by 1. 

This sequence of events, which constitutes a single time step update, is then repeated 
arbitrary number of times. The state of the network is observed after sub-step 3, before 
clock increase and repetition of sub-step 1. In order to explain the routing algorithms 
mentioned in sub-step 2, we will first describe one of its simplified versions. 



3 



Let us assume that we measure distance using metric dpM- To decide where to forward 
a packet located at a node r with the destination address r^, two steps are performed: 

1. From sites directly connected to r, we select sites which are closest to the destination 
Td of the packet. More formally, we construct a set Aoo(r) such that 

^oo(r) = {a G C(r) : d{a, r^) = min dpA/(x, r^)} (4) 

xeC(r) 

2. From Aoo{r), we select a site which has the smallest queue size. If there are several 
such sites, then we select one of them randomly with uniform probability distribution. 
The packet is forwarded to this site. Using a formal notation again, we could say that 
the packet is forwarded to a site selected randomly and uniformly from elements of a 
set -Boo(r) defined as 

-Boo(r) = {a G Aoo(r) : n(a, /c) = min n(x, /c)}, (5) 

xeAoo(r) 

where ?t,(x, k) is a queue size at a node x at time k. 

To summarize, the routing algorithm Rqo described above sends the packet to a site which 
is closest to the destination (in the sense of the metric dpM), and if there are several such 
sites, then it selects from them the one with the smallest queue. If there is still more than 
one such node, random selection takes place. It is clear that each packet routed according 
to the algorithm Rqo will travel to its destination along the shortest possible path (shortest 
in the sense of the metric dpM-, not necessarily in terms of a number of time steps required 
to reach the destination). In real networks, this does not always happen. In order to allow 
packets to take alternative routes, not necessarily shortest path routes, we will introduce a 
small modification to the routing algorithm Rqo described above. 

The modified algorithm R^, for each node r, will use instead of the set Aoo(r) a set 
Am(r) defined as follows. In the construction of the set Am(r) instead of minimizing dis- 
tance dpM{^iT^d) from X to the destination r^, as it was done in (^, we will minimize 
6m((^PM(x,rd)), where 

0™(?/)=l^' 'ir"' (6) 

y m, otherwise, ^ ' 

for a given integer m. Thus, the definition of the set Am{r) is 

Am{r) = {a G C(r) : 6m((ipM(a, r^)) = min 6m((ipM(x, r^))}. (7) 

x6C(r) 

The above modification is equivalent to saying that nodes which are further than m distance 
units from the destination are treated by the routing algorithm as if they were exactly m 
units away from the destination. If a packet is at a node r such that all nodes directly linked 
with r are further than m units from its destination, then the packet will be forwarded to a 
site selected randomly and uniformly from the subset of C(r) containing the nodes with the 
smallest queue size in the set C(r). It can happen that the selected site can be further away 
from the destination than the node r. 
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Therefore, introduction of the cutoff parameter m adds more randomness to the network 
dynamics. One could also say that the destination attracts packets, but this attractive 
interaction has a finite range m: packets further away than m units from the destination are 
not being attracted. 

It is also possible to relate various values of the cutoff parameter m to different types of 
routing schemes used in real packet-switching networks. Assume that each node r maintains 
a table containing all possible values of (ipjv/(x, r^^), for all possible destinations r^^ and all 
nodes x G C(r). Assume that packets are routed according to this table by selecting nodes 
minimizing distance, measured in the metric dpM, traveled by a packet from its origin to its 
destination. Such a routing scheme is called table-driven routing [|17| and it is equivalent to 
the routing algorithm Hoo- In this case, construction of the set Aoo{r) would require looking 
up appropriate entries in the stored table. 

Let us now define D^ax to be the largest possible distance between two nodes in the 
network. When m < Dmax, then for a given x, we need to store values of (ipM(x, r^) only 
for nodes which are less than m units of distance away - for all other nodes distance does 
not matter, since it will be treated as m by the routing algorithm. Hence, at each node r the 
routing table to be stored is smaller than in the case when m = D^ax- The routing scheme 
based on this smaller routing table is called the reduced table routing algorithm |17| and it is 
equivalent to the routing algorithm R^. In the case when m = Dmax the routing algorithm 

Rm Roo ■ 

Finally, when m = 1, the distances between hosts and destinations are not considered 
in the routing process of packets. Therefore, there is no need to store any table of possible 
paths at nodes of the network. This case corresponds to the table-free routing algorithm 
in which packets are routed randomly. Hence, this algorithm can send packets on circuitous 
and long routes to their destinations. 



3. Single packet delay 

One of the quantities characterizing the performance of a network is a packet delay r^, 
frequently used in network performance literature p|, ^ |, ||, |15|, |19|]. In our case, the 



delay will be defined as a number of time steps elapsed from the creation of a packet to 
its delivery to the destination address when the routing algorithm is used. In ||13[ we 
found that the free packet delay, or delay experienced by a packet when no other packets 
are present, strongly determines behavior of the network, in particular transition point to 
the congested state. Since in the case of a single packet there is no interaction with other 
packets, mathematical analysis of packet's dynamics is considerably simpler. This analysis 
will be performed in what follows. 

First of all, let us note that when the routing algorithm R^ is used, and when the packet 
is further than m units away from its destination address, it performs a random walk until it 
hits a node which is m units away from the destination, and then it follows the shortest path 
to the destination. Obviously, several shortest paths might exists, so there is still randomness 
in the packet's motion, but every time step its distance from the destination decreases by 
one unit. 

Let us denote by rm(ro, r^) the expected delay time experienced by a packet which starts 
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at ro and has destination address r^. For a lattice with periodic boundary conditions, only 
relative position of tq and is important. Therefore, we will choose r^i to be at the origin, 

and define r„(ro) = r„(ro, 0). 

From our discussion of the packet's motion we conclude that Tm{rQ) is a sum of two parts: 

Tm(ro) = T^,i(ro) + T^,2(ro), (8) 

where Tm,i{ro) is the expected time for a random walk to hit a node which is m units away 
from the origin, and Tm,2{ro) is the expected time to reach the origin starting from the node 
which is m units away from the origin. We will call T^,i(ro) a random part, and Tj„,2(i"o) ^ 
semi-deterministic part of the delay ^"^(ro). 
Obviously, for a single packet in the network 

Tm,2(ro) = Om(dpM{ro, 0)), (9) 

and it is only rm,i(ro) that needs to be computed (if m < Dmax)- It turns out that by 
modifying the problem slightly, an analytical estimation of rm,i(ro) can be obtained. 

3.1. Analytical estimation of the expected hitting time for a random walk on 
a lattice /y. 

First, we observe that for a random walk which start at Tq, Tm^i{ro) is the expected time of 
hitting the circle 5'm(0, dpM) = {r e £^ : dpui^-, 0) < m} 

While the circle S'm(0, dpM) defined in dpM metric is a natural one to be used in our 
network model, it is not well suited for the estimation of Tm,i{jo)- In order to carry such 
estimation, we will replace the circle SmiS^-, dpu) by the circle S'rre(0, dpE) in Euclidean metric, 
as explained below. 

For any two points ri = and Y2 = (3:2,1/2) in jC^ let us define the Euclidean 

distance with periodic boundaries between this two points as 

dpE{ri,r2) = \J (min{a;i -X2,L- {xi - X2)}Y + (min{yi - 1/2, L - (jji - 1/2)})^. 
Notice that this metric is equivalent to the periodic Manhattan metric dpM, in particular 

-^dpM{ri,Y2) < dpE{ri,r2) < c?pM(ri,r2). 

For r e D' let us set 

||r|| = dpE{r,0). 

Hence, for any a > 0, the circle of radius a is the set 

Sa^Sa{0,dpE) = {reC^: \\r\\<a}. 

Consider a simple random walk {Xk}, A; = 0, 1, 2, ... on >C^. Let TR(r; L) be the expected 
time of hitting the circle Sr on a lattice /y when the random walk {Xk} starts at Xq — r. 
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Theorem 3.1 Suppose that R{1 + e) < L/4 and R < ||r|| < L/4. // the random walk {Xk} 
starts at r then there exist a constant C = C{e) > such that 



Tn{T,L)>CL'\ogr-^ 



L RHog{\\r\\/R)^ 



(10) 



where we write y{x) = 0{x) whenever sup^^Qy{x)/x < oo. 



The proof of this theorem is based on the fohowing lemma. Consider two numbers a and c 
such that < a < c < L/2 and suppose that Xq = r with ||r|| = 6 G (a, c). Clearly, Sa C Sc, 
Xq e Sc and Xq ^ Sa- Let Pa,c(r) be the probability that the random walk {Xk} will hit the 
circle Sa before exiting Sc- 

Lemma 3.2 ///(r) = log(||r||^ + 1), then 

f{c)-f{b) log{c/b) + 0{l/b') 



Pa,c(r) < 



/(c) -/(a) log(c/a)+0(l/a2) 



Proof of the Lemma. The proof is conducted in the spirit of |T^, the reader can also find in 
this book the definition of submartingale and stopping time used further in this paper. 

Observe that C,k = f{Xk) is a submartingale with respect to a filtration JF^ = a{Xo, Xi, . . . , Xk) 
generated by the random walk {Xk}. Indeed, simple algebra shows that 

^\og{{x + iy + y^ + l) + ilog((x-l)2 + y2 + i) + ^iog(x2 + (2/ + i)2 + i) 

+ ^ \og{x^ + {y- If + 1) > log(x' + y^ + l) 



and therefore 



Let the stopping time 



r] = mf{k > : Xk e Sa or Xk e C'XS^} 



be the first time when the random walk leaves Sc\Sa. Then ^k = ^fcAr? is also a submartingale 
0, therefore 

E4>Eeo = /(&) (11) 

for all k. Obviously, rj is finite a.s., so ^fc converges in to C,r, [§] . On the other hand, f{Xrj) < 
f{a) if the random walk hits Sa before C^\Sc and f{Xrj) > f{c) otherwise. Consequently, 

E[f{X,)\X,eSa] < f{a), 
E[/(X,)|X,^5J > /(c). 

Since /(a) < /(6) and 

E (e,) = E /(X,) = E [/(X,) I X, G Sa]paA^) + E [f{X,) I X, ^ Sc]{l - Pa,c(r)), 
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E[/(X,) 


X, i - fib) 




X,^5J-E[/(X,) 





the inequality ([TT]) yields 

Pa,c(r) < 

^ E[/(X,)|X,^g,]-/(&) ^ f{c)-f{b) 

- E[/(X,)|X,^5e]-/(a) - /(c)-/(a)- 

Using the expansion log(a^ + 1) = 2 log a + 0(1/ a) applied to a, b and c we conclude the 
proof of the Lemma. Q.E.D. 




Figure 1: Illustration to the proof of the Theorem 3.1 



Proof of Theorem \3. 1[ The proof will proceed in three steps. First, we will obtain 
the upper bound on the probability of reaching Sji prior to leaving 6*^/2-1 when a random 
walk starts at a point r G G where G is the ring Sl/a\Sl/a-i- Next, we will estimate the 
expected time of reaching G starting from Cp\Sl/2-i- In the second step, we will show 
that the expected time of hitting Sr when the random walk originates inside G is of order 
L/^ \og{L / 2R) . Finally, we will use the fact that the expected time of hitting Sr when the 
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walk originates at some r with R < ||r|| < L/4 is at least as large as the product of the 
probability of hitting G prior to Sr and the expected time of hitting Sr starting from G. 
Step 1. 

Let G = Sl/4\Sl/4,-i be the set of lattice points inside the ring of "width" one. Consider 
for each r G G a simple random walk starting at r, and a probability PR,L/2-i(r) that 
the random walk starting at r will hit Sr before C^\Sl/2-i- Let p be smallest of these 
probabilities, that is 

p = mmpR^L/2-i{r), (12) 



then by Lemma 3.2 



p 



< 



log2 + 0(l/L) 



log(L/(2i?)) + 0(l/L + l/i?2)- 



Next, let us show that if the random walk starts in C^\Sl/2-i, then the minimum of all 
average times before hitting G is of order L^. Indeed, when the random walk hits some 
r = {x,y) G G, then dpE{r) < L/A and therefore both \x-L/2\ > L/A and \y-L/2\ > L/A. 
However, for any ri = (xi, yi) ^ 5'i/2-i at least one of the values Xi — L/2 or yi — L/2 lies 
inside the segment [-(^2 - l)L/4 - 1, {y/2 - l)L/A + 1] (see Fig. |]T]). Consequently, the 
time in which the simple random walk hits G is stochastically larger[| than U, the random 
variable representing the time in which one- dimensional simple random walk originating in 
X G [-(2 - v/2)L/4 - 1, (2 - v/2)L/4 + 1] leaves the segment [- [L/4J , [L/A\f\. The expected 
value of this random variable is known (see [0) and equals 

{[L/A\f -x^ >CiL^ (13) 

for some constant Ci > 0, because (2 — V2)/A < 1/4. 

Step 2. Let u = z/(r) = mi{k : G Sr} denote the first time when the random walk 
starting at Xq = r G G hits the circle Sr. Consider a stopped random walk Xk = XkAu with 
Xo = Xq. Set rjo = and let 

r]n = inf{fc > r]n-i : Xk e G and Xk' ^ Sl/2-i for some k' G (%-i, k)} 

for k = 1,2, . . .. Thus, r/^'s are consecutive times at which X^ finishes "a loop" from G to 
G visiting D'\Sl/2-i for some time. Since the random walk eventually hits Sr, only finitely 
many ry^'s will be defined. According to ([l^), the random number N of such loops before Xk 
hits C'^\Sl/2-i is stochastically larger than a geometric random variable N with parameter p 
defined by P{N > n) = {1 — p)", -n, = 0, 1, 2 . . .. The probability that the walk originating in 
G will visit C'^\Sli2 but will not visit 5*^, n times in a row is at least (1 — p)". Consequently, 

N N N 

i=l i=l i=l 



^One random variable is stochastically larger than another, if there is a probability space on which both 
random variables are simultaneously defined and with probability one the first one is at least as large as the 
other one. For further references, see Q. 

^By [a\ we mean the largest integer smaller than a. 
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where {Ui} is a sequence of random variables such that E {Ui \ N) > CiL^ in accordance 
with (plsl) . Since i/(r) > r^Tv, then for any r G G we obtain 



oo n 



Tn{r,L) = E z/(r) > Er^^ > ^ E f/, | iV = n)P(iV 



n] 



n=l 1=1 



oo T 
> C,L^Y.Ml-vT-' = ^>C,LHog^ 

p Art 



n=l 



i + o i 



1 



L i?2 log 



2R/ J 



where €2 = Ci/ log 2. 

Step 3. Now suppose that i? < ||r|| < L/4. By Lemma |3.1| , the event A = {X^. reaches 
G before hitting Sr} has the probability 



1 - PR,L/i{r) > 



/(||r||)-/(i?) log(||r||/i?) + 0(l/i?2 



/(L/4) - fiR) log(L/(4i?)) + 0(l/i?2 



Consequently, 



Tr{y,L) = Ez/(r) > E(z/(r) I A)P(A) > gminEz/(ri^ 

ri6G 



> CLHogi\\r\\/R) 



l + O - + 



L Rnog{\\r\\/R)^ 



smce 



log(L/(4i^)) log(l+e) 
log(L/(2/?)) -log(2 + 2e) ' 



and the Theorem is proven. 



Q.E.D. 



Corollary 3.3 Under conditions of Theorem \3. 1\ , if R is fixed while both ||r|| — >■ 00 and 
L 00, then 

Tnir,L)>CLHog{M/R)[l + o{l)]. 



3.2. The asymptotic behavior of Tj 



R 



Here we will study the case when L is so large, that a simple random walk after appropriate 
rescaling is close to a Brownian motion Bt on a square C = [0, 1]^ with periodic boundary 



conditions. Let 0<£:<l,rG£ and Ti;(r) be the expected time in which Brownian 
motion starting from r will hit a circle of radius e. To avoid a trivial answer, we always 
assume that r lies outside of this circle. When the rescaled random walk starting at r is 



close to the Brownian motion [12|, then for sufficiently large L and R 



Tn{r;L)^2L^fn/L{r/L). 



(14) 



Therefore, from bounds on Te{r) we can deduce the asymptotic behavior of Tf>{r, L). 
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It follows from |T[], p. 109, that the function Ti;{r) is a solution of the PDE on a square 
with periodic boundaries 

AT = -2, 
nr)Udc. = 0, 

where for any e > 0, dCe denotes the boundary of a circle of a radius e > around the 
origin 0. Here we will not be solving this PDE analytically. We will present estimates of T, 
which follow from a probabilistic nature of the model. The following statement is essential, 
the idea of its proof comes from 

Lemma 3.4 Consider a Brownian motion Bt on a plane starting from r G R^, such that 
p = |r| G (a, b) and < a < b. Let u = u{p; a, b) be the expected time until Bt hits the circle 
Ca, excluding the time spent outside the circle Cb, that is 

u=E l{\Bt\<b}dt 

where z/^ = inf{t : \Bt\ < a}, then 

a 2 



u{p] a, b) = b^ log - — - — - — . (15) 



Proof. For a Brownian motion Bt with Bq = r = {x,y) G Cb and u{r) = Ez/(r) we define 
z/(r) = Jq'' l!^\Bt\<b}dt. Consider a circle of a small radius po around r. Since u{r) is a constant 
on dCb, then from the symmetry of a circle and by Markov Principle 



1 /■02 

^(^) = 1 1- / ^(^ + Po cos 4>,y + Po sin 

02 - 01 J't'i 

I r<j>i+2Tr 



+ 9 — ^-rr I U{v)dct> + 0{p^). 
27r — 02 + 01 J4>2 

In this equation the angles 0i and 02 are defined in such a way that G (0i, 02) corresponds 
to the points (x + pocos0, ?/ + posin0) lying inside the circle Cb and G (02, 0i + 27r) 
corresponds to the points lying outside of the circle Cb- Taking a Taylor expansion and 
letting Po — yields 

Viz(r)-n(r)|reac, = 0, (16) 

where n is a unit vector normal to dCb at r. On the other hand, for r lying inside the set 
{r : |r| < 6} we have 

Am = -2 (17) 

(see P). Solving PDE ( p!7D with the boundary conditions (|T6]) and the condition u(r)|rgac'a = 
0, we obtain (|15D. Q.E.D. 

Now, to get the desired estimates on T, observe that the geometry of the model implies 
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where p is the distance from r to in Euchdean periodic metric on C In particular, using 
the R.H.S. of this inequahty we obtain the following result. 



Corollary 3.5 Whenever ( [T^ j takes place, TR(r, L) is asymptotically hounded from above 
by 

LHog^^-{\\rr-R') + o{L'). (18) 

In terms of order, this equation matches closely the lower bound given by (]I(]|). This is 
consistent with our results for the discrete case and not really surprising, since the limit of 
a random walk is a Brownian motion. 

3.3. Numerical results 

In order to assess quality of analytical estimates of Tji{r) = Tji{r, L) obtained in the previous 
section, we will compare them with values of Tii[y) calculated numerically by solving the 
system of hnear equations 

Tr{v) = 1 + i(TH(r + c,) + TR{r - c,) + TR{r + c,) + T^(r - c,)), (19) 

with periodic boundary conditions and ^^(r) = for every r G £^ such that dpEijc.Qi) < R. 
Figure |^a is a semi- log plot of TR(r) as a function of | |r| | for the lattice L x L = 50 x 50 and 
two values of R, R = 1 and R = 5. Each lattice node for which ||r|| > i? is represented by 
a single point on the graph. One can clearly see that for ||r|| smaller than about 10, these 
points form a straight line, in agreement with estimations (p!0| ) and (0). 
Once we notice that for every r G £^ such that dpM{r, 0) = m we have 

m\/2 

-^<dpE{r,0)<m, (20) 

we can obtain the following bounds on Tm,i{r): 

T^(r)<r„,i(r)<7^„^/2(r). (21) 

The above relationship is well illustrated in Figure Qd, which shows a graph of Xm,i(r) as a 
function of dpM{r, 0) for L = 50 and m = 1, 5. As before, the values of Tm,i(r) were obtained 
by solving the system of linear equations 

rm,i(r) = 1 + ^(rm,i(r + c^;) + rm,i(r - c^) + rm,i(r + Cy) + rm,i(r - c^^)), (22) 

with periodic boundary conditions and i(r) = for every r G £^ such that dpuij, 0) < m. 
In the aforementioned figure, the points close to the origin do not lie on a straight line, but 
lie in an area bounded by two straight lines, as expected from (pl|). 
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1 2 3 4 5 6 7 8 910 20 30 



a) 




1 2 3 4 5 6 7 8 910 20 30 40 50 



b) dpM(r,0) 

Figure 2: Graphs of (a) Tji{r, 50) as a function of | |r| | for = 1, 5 and (b) r^(r) as a function 
of dpM{i^, 0) for m = 1, 5 for a lattice /y with L — 50. Continuous hues are the least square 
fits using points with ||r|| < 10. 
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4. Average delay 



In a network model investigated in [|T3|, packets were created at each node with a destination 
address randomly selected among all nodes of the lattice. A useful quantity characterizing 
delay experienced by packets under such circumstances is an average delay Tm, defined as 

^m = ^J2 (23) 

Similarly as in (^), we can write the average delay as a sum of the average random and 
the average semi-deterministic parts, denoted by Tm,i and Tm,2, respectively. 

Using (P), we will calculate the average semi-deterministic part of the average delay. 
First, let us define N{k) to be a number of sites r E such that (ipM(r, 0) = k, < k < L. 
Then we can write Tm,2 as 

Tm,2 = ^ E ^-,2(r) = E N{k)eUk). (24) 

For simplicity, and without much loss of generality, in what follows we will assume that L is 
even. It is straightforward to establish that for even L 



N{k) 



1 if A; = 

Ak iiO<k<L/2 

2L-2 iik = L/2 (25) 

4(1 -k) iiL/2<k<L 

1 iik = L 



which can written in a more compact form as 

N{k) = 5o,k + 5L,k - 25L/2,k + 2L- \Ak - 2L\, (26) 

where 6ij = 1 if i = j and 6ij = otherwise. Using this result and computing the sum in 
(|2^ ) we obtain 

2m^ + m ^ ,^ 

m — , it m < L/2 



3L2 

L 2{L-mf + L-m 



rm,2={ , ,3^r (^7) 

i'^ -J- /, — m 

otherwise. 



2 3L2 

Since the average semi-deterministic part of the average delay is always smaller than m, 
for small m it will be negligible compared to the random part. Therefore, in the small m 
regime, we can expect that the leading term in Tm is a linear function of log(m), according 
to our analytical estimate from the previous section. Figure ^ shows that it is indeed the 
case, as illustrated for L = 100. An important observation which can be made from this 
figure is that stays close to its m = L value (Ti=L/2, see eq. when m is close to L. 
This means that making m slightly smaller than L does not increase delay significantly. 
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Figure 3: Average delay of a free packet as a function of m for a periodic lattice 50 x 50. 
The continuous line represents the least squares fit to the first 10 points. 



5. Network scalability 

Every network at some point of its life span needs to be expanded. It is obvious that as the 
number of nodes increases, the average delay increases as well, since the number of links to 
be traversed by a given packet becomes larger. However, the increase in delay, is not the only 
problem encountered when the network expands. Each node r stores a routing table, which 
in our model contains routing information for all nodes x G £^ such that (ipM(i",x) < m. If 
by M{m) we denote the number of nodes which arc up to m links away from a given node, 
we can say that the memory required to store the routing table is proportional to M(m), 
which can be readily computed: 



I 2m(m + l). ifO<m<L/2 
M(m) = y N(k) = { (28) 
^1 [ L2-2(L-m)(L-m-l)-2, if L/2 < m < L. 



Let us now assume that the "cost" of operating of a single node with routing algorithm 
is given by 

c(m, a) = r„, + aM{m), (29) 

where a is a nonnegative parameter describing the relative cost of memory vs. average delay. 
This cost function has been introduced to investigate strategies which could minimize both 
average delay and memory storage requirements at a node. The above form of c(m, a) simply 
means that the cost is a linear combination of memory used to store the routing table and the 
average delay experienced by packets. By using this form we want to express the fact that 
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Figure 4: Cost function as a function of a and m shown as a contour plot (a). Part (b) shows 
the graph of the cost function as a function of m for a fixed value of a (a = 1.58). 

the delay experienced by packets decreases utility of the network, and therefore increases its 
"cost" . 

Figure ^ shows how the total cost c(m, a) depends on m and a for L = 50. For any given 
value of a, one can find the value of m which minimizes the total cost, as shown in Figure 
|b. 

Obviously, when a is very small, i.e., when the cost of storage is negligible, the total cost 
is minimal at m = L. This means that if the delay alone is taken into consideration, full table 
routing is always a best choice. In that case, c(m, a) will increase with L as L^, meaning 
that the cost per node will grow proportionally to the number of nodes in the network. 

When a is large, the situation is very different. Let us assume, for example, that the 
value of a is large enough so that the value of m minimizing c(m, a) is small compared to L. 
In this case, the random part of is much larger than the semi-deterministic part, and we 
can assume that the leading term of has the form 

BL 

Tm ~ Ti^m = AL^ log , (30) 
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where A and B are constants independent of L, and therefore 



BL 

c(m, a) ^ AL'^ log h2am(m + l). (31) 

m 

The above cost function is minimized by 

which is an asymptotically linear function of L. This means the optimal strategy which 
should be used to minimize the "cost" of the network is to increase m proportionally to L, 
or in other words, to increase the size of the routing table proportionally to the number of 
nodes in the network. Note that in this case the cost will still grow with L, and for large 
values of L it will grow like L^, similarly as in the case of very small a. 



6. Conclusion 



We have investigated individual packet delay in a model of data networks with table-free, 
partial table and full table routing. We presented analytical estimates for the average packet 
delay in a network with small partial routing table and compared them with numerical 
results. We have also examined the dependence of the delay on the size of a network and 
on the size of a partial routing table. Assuming the total "cost" of a network with routing 
algorithm is a linear combination of memory used to store the routing table and the 
average delay experienced by packets, we discussed consequences of our findings for network 
scalability. If we are concerned primary with the speed of the network and the memory cost 
is not important, full table routing is the best choice. On the other hand, if the primary 
factor influencing the total cost is an amount of memory used to store routing tables, the 
optimal strategy which should be used to minimize the cost is to keep a size of a routing 
table proportional to a number of nodes in a network. In that case, the cost per node c(m, a) 
grows linearly with the size of the network. 
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